Community Aware Influence Maximization on Large Scale Networks Using Mapreduce
نویسنده
چکیده
Influence maximization problem is a well known problem to find the top-k seed users who can maximize the spread of information in a social network. The primary concern is monte carlo simulations method is suffering with scalability issues while the selection of seed users .It takes days to find potential seed users in large datasets. In this paper, we propose a highly scalable algorithm for identifying Influential nodes on largescale graph using the Map Reduce framework. We perform a combined community detection algorithm and consider the most influential users in the community as the candidate for the top-k seeds. This approximation allows us to divide the whole graph into multiple sub graphs that can be processed independently. Then, for each sub graph, a Map Reduce based algorithm is designed to identify the minimum-sized influential vertices for the whole graph. This original approach contrasts with previous influence propagation models, which did not use similarity opportunities among members of communities to maximize influence propagation. The performance results show that the model activates a higher number of overall nodes in contemporary social networks, as compared to existing landmark approaches.
منابع مشابه
Adaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments
Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...
متن کاملTipTop: Exact Solutions for Influence Maximization in Billion-scale Networks
In this paper, we study the Cost-aware Target Viral Marketing (CTVM) problem, a generalization of Influence Maximization (IM), a well-known problem in viral marketing. CTVM asks for the most cost-effective users to influence the most relevant users. Instead of approximating the problem as done in the literature, we attempt to offer exact solutions. As the problem is in NP-hard, clearly the exac...
متن کاملScalable community detection in massive social networks using MapReduce
detection in massive social networks using MapReduce J. Shi W. Xue W. Wang Y. Zhang B. Yang J. Li In this paper, we present a community-detection solution for massive-scale social networks using MapReduce, a parallel programming framework. We use a similarity metric to model the community probability, and the model is designed to be parallelizable and scalable in the MapReduce framework. More i...
متن کاملExploring Community Structures for Influence Maximization in Social Networks
Since the surge of the popularity of social network, recently, there has been a tremendous wave of interest in the investigation of influence maximization problem. Given a social network structure, the problem of influence maximization is to determine a minimum set of nodes that could maximize the spread of influences. Nowadays, due to the dramatic size growing of social network, the efficiency...
متن کاملAnalyzing Patterns in Large-Scale Graphs Using MapReduce in Hadoop
Analyzing patterns in large-scale graphs, such as social networks (e.g. Facebook, Linkedin, Twitter) has many applications including community identification, blog analysis, intrusion and spamming detections. Currently, it is impossible to process information in large-scale graphs with millions even billions of edges with a single computer. In this paper, we take advantage of MapReduce, a progr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017